BTCC / BTCC Square / Global Cryptocurrency /
NVIDIA Unveils ProRL v2 to Enhance LLM Reinforcement Learning

NVIDIA Unveils ProRL v2 to Enhance LLM Reinforcement Learning

Published:
2025-08-13 22:11:02
10
3
BTCCSquare news:

NVIDIA has launched ProRL v2, a groundbreaking reinforcement learning framework designed to push the boundaries of large language models (LLMs). The system extends training to over 3,000 RL steps across five domains, leveraging advanced techniques like chain-of-thought prompting and tree search to improve stability and performance.

The innovation introduces KL-regularized trust regions and periodic policy resets, addressing historical instability in prolonged RL training. By testing the limits of extended learning, ProRL v2 aims to unlock new capabilities in LLMs beyond conventional benchmarks.

|Square

Get the BTCC app to start your crypto journey

Get started today Scan to join our 100M+ users